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Abstract 

The paper discusses various applications of permutation group theory 
in the synthesis of reversible logic circuits consisting of Toffoli gates with 
negative control lines. An asymptotically optimal synthesis algorithm for 
circuits consisting of gates from the NCT library is described. An algo¬ 
rithm for gate complexity reduction, based on equivalent replacements of 
gates compositions, is introduced. A new approach for combining a group- 
theory-based synthesis algorithm with a Reed-Muller-spectra-based syn¬ 
thesis algorithm is described. Experimental results are presented to show 
that the proposed synthesis techniques allow a reduction in input lines 
count, gate complexity or quantum cost of reversible circuits for various 
benchmark functions. 

Keywords: reversible logic, synthesis, permutation group theory. 


1 Introduction 

Reversible logic circuits have been studied in many recent papers [1], [10], [12], 
[14], [18]. On the one hand, the interest in these circuits is caused by the 
theoretically possible reduction of energy consumption in digital devices due 
to the reversibility of all computations [4]. On the other hand, all quantum 
computations are necessarily reversible. Hence, with the help of a reversible 
circuit, one can model a quantum circuit. 

One important research area is the development of new efficient and fast 
synthesis algorithms, which can produce a reversible circuit with low gate com¬ 
plexity and depth. However, for the purpose of a comparison between different 
synthesis algorithms, we should first choose a library of gates, from which a 
synthesized circuit will consist. One such gate library is one that includes NOT 
(inversion gate), CNOT (Feynman gate) and C 2 NOT (Toffoli gate). We will 
refer to it as the NCT library. Another popular gate library is the GT library, 
which includes generalized Toffoli gates with positive and negative control input 
lines. Both libraries are functionally complete in terms of the ability to con¬ 
struct a reversible circuit that implements a desired even permutation from the 
alternating group A(B ra ) without using additional inputs. An odd permutation 
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from the symmetric group S^B^) can always be realized in a reversible circuit 
without additional inputs in the GT, but not in the NCT library. 

For many proposed synthesis algorithms, an upper bound for the gate com¬ 
plexity of a reversible circuit in the worst case is proved. Though it was proved 
that the worst case requires fi(n2"/logn) gates from the NCT library [15], 
almost all these bounds are of the form 0(n2 n ) in the NCT library [9]. 

Recently, the first asymptotically optimal in NCT library synthesis algorithm 
was introduced with the gate complexity L{&) < 3n2 ra+4 / log 2 n of a reversible 
circuit in the worst case [18]. In Section 2, we briefly describe this cycle-based 
algorithm. Section 3 contains descriptions of the replacement rules from [17] 
and of a “moving and replacing” algorithm for reducing the gate complexity 
of a reversible circuit in NCT and GT libraries with the help of these rules. 
In Section 4, we discuss various approaches of reducing the gate complexity 
during the synthesis process. In Section 5, we introduce a novel technique 
for combining a cycle-based synthesis algorithm with a Reed-Muller-spectra- 
based one. Experimental results of benchmark functions synthesis are presented 
in Section 6; all new circuits were obtained with the help of our open source 
software ReversibleLogicGenerator [19] that implements all synthesis techniques 
described in this paper. All results we present here (except Section 2 and the 
first part of Section 3) are new. 

We use the following notation for a generalized Toffoli gate with negative 
control input lines. 

Definition 1. A generalized Toffoli gate TOF(I; J;t) = TOF(ii, ■ ■ ■ ,i r ',ji, • ■ ■ 

• • • ,j s ',t) is a reversible gate, which defines a transformation fi-j-t- B" —» B” 
as follows: 

fi-,J-,t(( x !>••* 1 x n)) = {xi, - ■ ■ ,x t ®x il A • • • A Xi r Ax h A ■ ■ ■ Axj e , - ■ - ,x n ) , 

where I = { ii, ■ • • , i r } is a set of indices of positive control input lines, J = 
{iii''' , js } is a set of indices of negative control input lines, and t is an index 
of a controlled output line, /OJ = 0, 

In the case of the absence of negative control input lines, a generalized 
Toffoli gate will be referenced as TOF(I;t ), and in the case when a generalized 
Toffoli gate has no control input lines at all, it will be referenced as TOF(t). In 
other words, TOF(t ) = TOF(0;0;f) and TOF(I;t) = TOF(J;0;f). Using this 
notation, we can refer to a NOT gate as TOF(a), to a CNOT gate as TOF(b; a) 
and to a C 2 NOT gate as TOF(b,c;a). 


2 Asymptotically optimal synthesis algorithm 

In [18] a cycle-based synthesis algorithm that can produce a reversible circuit 
with the asymptotically optimal in NCT library gate complexity for any even 
permutation on the set B™, was described. It is the first and currently (as far 
as we know) the only asymptotically optimal non-search synthesis algorithm for 
the NCT library. Our software [19] is based on it, so we are going to briefly 
describe the essence of the algorithm. 

Let’s consider an even permutation h G A(B"). The main idea is a decom¬ 
position of h into a product of transpositions in such a way that all of them can 
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be grouped by K independent transpositions 1 : 

h = G\ o G2 o • • • o Gt o h' , 

where Gi = (x 7 ; 1,y I;1 ) o ••• o (x I; ^,y ,;/<-) is an i-th group of K independent 
transpositions, x, :? , y,; j g B n and h' is a residual permutation. 

Using vectors of a group G^, we construct a matrix Ai as follows: 

Ai = [xjp y^i Xi,x yi,x] T • 

The matrix Ai is a 2IF x n binary matrix. If 2 2K < n, then some columns in 
it are equal to one another. These duplicated columns can be zeroed-out in the 
matrix, using CNOT gates, with the help of conjugation; this results in a new 
matrix A^\ 

Note that the matrix Ai defines a permutation 7 r* € S'(B n ) and every gate e 
from the NCT library defines a permutation h e G S(B n ), for which hj 1 = h e . 
Therefore, a conjugation of a permutation 7r by a permutation h e , denoted as 
7 T h “ = h~ l o ir o h e , corresponds to attaching the gate e to the front and back 
of a current sub-circuit. For example, if the first two columns in the matrix Ai 
are equal, we can zero-out the second column with the help of two TOF( 1; 2) 
gates. 

Next, we fix all pairwise distinct nonzero columns { Cj 1 , ■ • • , Cj d } in the ma¬ 
trix A ^ and choose an index of a controlled output t from the set { Ji, • • • ,jd }• 
After that we transform the matrix A^ to the canonical form with the 
help of conjugation, where an Z-th row, l is odd, differs from the (l + l)-th row 
only in t-th element. 

( 2 ) (3} 

And finally, we transform the matrix A\ ' to the final form A\ ; with the help 
of TOF(j) gates, where j £ { j\, ■ • • , j ( i }. In [18] it was proved that the matrix 
Af J> can be realized by the single gate TOF{{ 1, • • • , n } \ { ji, ■ • • , j c i }; t)- The 
gate can be represented as a composition of C 2 NOT gates if K > 1 (the number 
of independent transpositions in a group Gi). 

A synthesized reversible circuit ©, produced by the algorithm, has the gate 
complexity L(&) < 3n2 n+4 / log 2 n. if K = O) log 2 n — log 2 log 2 n — log 2 c/)(n )), 
where <f>(n) < n / log 2 n is an arbitrarily slowly growing function, and the gate 
complexity L{&) < 6n2 n , if K = 2. These results were proved in [18]. 

In our software [19], we can change the parameter K to achieve the best 
synthesis result in a particular case. But in practice, when the number of input 
lines in a circuit is large, it is almost always the best option to use K = [log 2 n] 
during the synthesis process. 

The time complexity of the synthesis algorithm is T(A) = 0(n2 n j log 2 n) in 
the worst case. 


3 Generalized replacement rules for gate com¬ 
positions 

One of the most widely used gate complexity reduction techniques is an applying 
gate compositions templates to a reversible circuit. For example, such templates 

hereinafter a multiplication of permutations is left-associative: (/ o fj)(x'j = gif(x)). 
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(c) 


Figure 1: Examples of independent gates: (a)-(b) NCT library specific; (c) GT 
library specific. 


were considered in [9]. This approach involves storing templates and finding 
them in a circuit. But we can interchange some adjacent gates of NCT and GT 
libraries in a reversible circuit without changing the resulting transformation, 
defined by the circuit. We call such gates independent. 

In [5] the necessary and sufficient conditions for the independence of two 
TOF(Ij\tj) gates were proved. However, for the gates from the GT library we 
can supplement these conditions. 

Lemma 1. Gates TOF(I]_\ and TOF^ff', £ 2 ) are independent iff at 

least one of the following condition holds (see Fig. 1): 

1. t\ U J 2 and t% £ I\ U J\ (in particular, t\ = t 2 ); 

2. 1 1 (~l J 2 ff 0 or I 2 n Ji 0. 

Proof of the Lemma 1 was partly given in [17]. Even though the first con¬ 
dition of gate independence was already known before [5] (see Fig. la-lb), the 
second one cannot be derived from it (see Fig. lc). 

In [3] rule-based optimization techniques based on Karnaugh maps for the 
optimization of sub-circuits with common targets were described. The main 
disadvantage of this approach is the restricted scalability for circuits with the 
large number of input lines. On the other hand, the advantage of using negative 
control Toffoli gates for the simplification of reversible circuits and reducing 
their quantum cost was shown by the authors. 

In [17] we proposed generalized replacement rules for the case of an arbi¬ 
trary number of input lines. Moreover, we were able to obtain a new rule for 
interchanging two gates with changing the polarity of a control line for one of 
these gates (see the last rule in Table 4). Our replacement rules are essentially 
templates of small length. But the advantage of using them is in changing the 
set of negative control input lines in a gate. This makes it possible to obtain 
independent gates instead of dependent ones in some cases, interchange or move 
them in a reversible circuit to new places and apply other replacement rules. 
“Moving and replacing” algorithm will be described later. 

A similar approach was used in [11], though replacement rules in that paper 
differ from ours. 
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Tabic 1: Representing a gate from the GT library as a composition of gates 



Let’s consider a composition of two dependent gates ei * e^. Let h ei and 
h e2 be permutations defined by them respectively. If we want to obtain an 
equal composition ©i * e\ = e± * e 2 , then the circuit ©i must implement the 
permutation hi = h e2 ei . And if we want to obtain an equal composition e 2*©2 = 
ei * e 2 , then the circuit ©2 must implement the permutation /12 = h e l 1 2 3 4 . 

All our replacement rules can be classified as follow: 

1. Representing a gate from the GT library as a composition of gates from 
the NCT library. 

2. Merging two gates into one. 

3. Reducing the negative control lines number. 

4. Interchanging two dependent gates. 

For the sake of clarity, we will organize the detailed description of our rules 
in the form of Tables 1-4, one for each rule “class”. The left column of the 
tables contains a gate composition before applying replacement rule, and the 
right column contains the result of the replacement. For every rule a picture 
goes first (for understanding the concept of a rule), then, a text description of 
the rule, and finally, a condition for applying the rule. 

Now we can describe the “moving and replacing” algorithm implemented in 
our software [19], which may reduce the gate complexity of a circuit. 

Let a reversible circuit © be a composition of l gates from the GT library: 

© = * ei. If a gate composition * e,- satisfies the condition of a replacement, 

i—1 

where i < j, and there is such an index s, i < s < j , that gates and e*, are 
independent for every i < k < s, and gates ej and e*, are independent for every 
s < k < j , then the gates ej and e :j can be removed from the circuit and a result 
of the replacement for e; * ej can be inserted between gates e s and e s +i. 
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Table 2: Merging two gates into one. 



So, the “moving and replacing” algorithm first searches a pair of gates, 
the composition of which satisfies the condition of a replacement. After that 
the algorithm checks if they can be moved to each other, using Lemma 1. If 
yes, it implements a replacement as described above. In the case, when the 
gate complexity is not reduced after replacement, but there are new gates in a 
circuit, the algorithm continues to work, until the gate complexity is reduced or 
there are no new gates. 


The time complexity of the proposed “moving and replacing” algorithm 
T(A) > R ■ l 2 , where R is the number of replacement rules, l is the gate com¬ 
plexity of an original circuit. It is almost the same as the time complexity of 
any template based optimization algorithm. At the same time, our “moving and 
replacing” algorithm seems to be more flexible than a template-based approach, 
because proposed replacement rules do not depend on the number of inputs in a 
reversible circuit, therefore there is no need to store a large number of templates 
and search them in a library. 
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Table 3: Reducing the negative control lines number. 



4 Boolean hypercube search 

Let’s consider the following permutation: 

h = ((1> 0,0,0,0), (1,0,1,0, l))o 
°((1> 0,0,0,1), (1,0,1,0,0))o 
°((1) 0,0,1,0), (1,0,1,1, l»o 

o«i, 0,0,1,1), <1,0,1,1,0)). 

As we can see, vectors in every transposition of permutation h presented above 
differ only in the 3-rd and 5-th coordinates. There are four transpositions total. 
Hence, a set of all vectors in these transpositions represents a Boolean 3-cube 
contained in a Boolean 5-cube B 5 . This 3-cube can also be denoted as 
(1, 0, *, *, *). Therefore, the permutation h can be implemented by a composi¬ 
tion of gates TOF(l- 2; 3) * TOF{ 1; 2; 5). 

Let’s assume we can represent a permutation h £ A(B") as a product of 
transpositions in such a way that a set of all vectors of first k transpositions 
in this product represents a Boolean (1 + log 2 fc)-cube. In the case, when we 
use only our cycle-based approach for the synthesis, we have to divide these 
k transpositions into groups and synthesize them separately. This approach 
can lead to significant gate complexity of a produced reversible circuit. On the 
other hand, any Boolean hypercube contained in B” can be implemented by a 
composition of no more than n generalized Toffoli gates TOF(I ; J;f). 

For example, a transformation / ((xi,X2, • ■ • , x n }) = (x\, x%®xi, 23 ,..., x n ) 
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Table 4: Interchanging two dependent gates. 



TOF{h-JvM)*TOF(h-J 2 -M) 


— 

•- 

>- 

n r 

[A 



3 k. 

J 




—€ 

3 -€ 



TOF(h ; J 3 ; t 2 ) * TOF(h , j 2 ; t 2 )* 
*TOF(I\\ 

h = h U / 2 \ { ti }, J3 = Ji U J 2 \ { t\ } 


Condition: gates are dependent, t\ £ I 2 U J 2 and t 2 £ h U J\. 




V 




TOF(I 1 -J 1 ;t 1 )*TOF(I 2 - : J 2 - : t 2 ) 




1 r 

A 

T 

J 

—e 

5— 



TOF(l i] J; i U 2 )*TOF(I l :J l :t 1 ) 
h = (h\{ h }) u (J 2 n { h }) 
J3 = (J2 \ {t \}) u (i 2 n {ti}) 


Condition: gates are dependent, I\ C I 2 and Ji C J 2 . 


can be implemented by a reversible circuit, produced by our main synthesis 
algorithm, with the gate complexity 0(n2 n ), and it can be implemented by a 
single gate TOF( 1; 2), because there is a Boolean 1-cube (1, *, ■ • • , *). 

It is obvious that searching a Boolean hypercube can take a significant 
amount of time and can be inefficient for large functions. But this approach 
makes it possible to obtain better synthesis results in some cases. 

4.1 Effective disjoints of cycles 

To find a larger Boolean hypercube, we should somehow effectively represent 
a permutation h as a product of specific transpositions. Let’s consider a per¬ 
mutation h = (a,b, c, e, /, g), where the Hamming distances d(a, e) = d(b,g) = 
d(c , /) = A and the Hamming distance for any other two elements of h is not 
equal to A. We have the two possible representations of h as a product of cycles: 

1 ■ h = (a, e) o (a,f,g) o ( e,b,c ). 

2 ■ h = ( b,g) o (c,/) o (a, b) o (,c,g ) o (e,/). 

We can see that in the first case only the cycle (a, e) has the two elements with 
the Hamming distance equal to A. But in the second case there are two cycles 
(&, g) and (c, /) that have the two elements with the Hamming distance equal 
to A. Therefore, we can assume that the set { b, g, c, / } may contain a larger 











































Boolean hypercube, compared to the set {a, e }, and we will call the second 
representation of h an effective disjoint of cycles. 

There is a simple linear algorithm for an effective disjoint of cycles of a 
permutation for a given Hamming distance A. In the first pass, the algorithm 
searches all pairs of elements in a cycle with the Hamming distance equal to A. 
In the second pass, the algorithm calculates for a found pair p how many other 
pairs would be broken, if we disjoint the cycle by the pair p. In the third pass, 
the algorithm chooses a pair p, for which the number of broken pairs is minimal. 
And finally, the algorithm disjoints the cycle by the chosen pair. After that we 
don’t have to repeat all steps for obtained cycles, because we can simply remove 
broken pairs and use previous results for further disjoints. 

For our example above, we have the three pairs with the Hamming distance 
equal to A: (a, e), (b,g) and (c,/). If we choose the pair (a, e), it would break 
two pairs (b,g) and (c, /). And if we choose either (b,g) or (c,/), they would 
break only the pair (a, e). Hence, an effective disjoint will be for the pair (b,g) 
or (c, /). It is not difficult to show that the proposed algorithm for an effective 
disjoint of cycles doesn’t depend on the order of elements in a cycle. The 
disjoint result will be the same for the permutation h = (a,b,c,e, f,g) and for 
the permutation h' = (c, e, /, 3 , a, b). 

The time complexity of a single disjoint operation for a cycle of length l is 
no more than 0(1 log 2 l ). 

4.2 Left and right multiplication 

Until now we used a left multiplication for a cycle disjoint. But we can also use 
a right multiplication. E. g., a cycle (a, &, c) can be represented in two ways for 
the transposition (a,b): 

1. Left multiplication: (a, b, c) = (a, b) o (a, c). 

2. Right multiplication: (a, b, c ) = ( 6 , c) o (a, b). 

We can see that the results of the multiplications are different. This difference 
can lead to significantly different synthesis results. 

There is no way to find out on an i-th step of our basic synthesis algorithm, 
whether the left or right multiplication would be the best in the end. The only 
thing we can do is to make both left and right multiplications on an i- th step 
and choose the one which leads to the greater permutation reduction and to the 
lower gate complexity of a current reversible circuit. This approach doubles the 
synthesis time, but it also leads to better reversible circuits in some cases. 

And finally, another area for optimizations is the constructing of a bijec- 
tive transformation for a given non-bijective one. We believe that in terms of 
reversible logic synthesis the best result can be achieved, when this bijective 
transformation has minimal Hamming distances between inputs and outputs. 


5 Combining cycle-based and RM-spectra based 
algorithms 

In [13] a hybrid framework was proposed, which combines a cycle-based and 
a RM-spectra based algorithms. Unfortunately, this combination is only the 
choice of a better reversible circuit synthesized by one or another algorithm. 
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We propose a new approach for combining a cycle-based and a RM-spectra 
based algorithms. In [9] a RM-spectra based synthesis algorithm was described. 
For a reversible specification /: B" —>• B ra the algorithm successively transforms 
the truth table T n to the truth table that corresponds to the identity transfor¬ 
mation. This is done by changing an i-th row in T n , for which T n [i] ^ i. to the 
form T n [i\ = i for every i = 0, • • ■ , (2™ — 1). Every row j < i is not changed 
after a transformation of an i-th row: T n [j] = j. 

Our combining approach allows us to modify the truth table T n in such a 
way that for the first row i, for which T n [i\ ^ i , T n [j] = j, j < i, each row k < i 
in a modified truth table T' n will be equal to itself: T' n [k] = k. 

On an i-th step of the RM-spectra based synthesis algorithm from [9] a 
reversible circuit © which we synthesize is of the form: 


<3 = &i * ©T„ * &r , 


where sub-circuits ©; and © r were constructed on the previous steps and a 
circuit &T n is unknown, it implements a transformation described by the truth 
table T n , for which T n [j] = j, j < i, T n [i\ ^ i. The original RM-spectra based 
synthesis algorithm appends gates to the ©; or © r after modifying the i-th row 
in T n . 

Let h , hi, ht , h r be the permutations, defined by the circuits ©, ©;, &r n 
and © r respectively. This implies that 


h = hi o ht o h r . 


Let’s assume T n [i] = k and T n [l] = i, where k,l > i. We can state that 

h =hi o h' t o (i, k) o h r = hi o h' t o h r o (i, k) hr , 
h =hi o (i, l) o h' t o h r = (i, l) h ‘ o hi o h' t o h r , 

where a permutation h' t is defined by the truth table T' n , T' t [j] = j for every 

j < i- 

From this it follows that we can “push” one transposition (i, k) or (i, l) from 
the permutation h t to the right or to the left, conjugate it by the permutation 
h r or hj 1 respectively and “skip” the transformation of the i-th row in the truth 
table T n by the original RM-spectra based synthesis algorithm. After that we 
can move to the next row and repeat this process. 

After a RM-spectra based synthesis algorithm finishes its work, we can use a 
cycle-based synthesis algorithm to synthesize pushed transpositions. There are 
several approaches to decide, whether an i-th row is pushed or not and where 
it will be pushed (left or right). For example, we can push an i-th row only 
when the Hamming weight of i is greater or equal to a predefined threshold w. 
It is equivalent to processing all monomials of degree d < w in a Reed-Muller 
polynomial with a RM-spectra based synthesis algorithm. All other monomials 
will be processed by a cycle-based synthesis algorithm. 

We realized the proposed combining approach in our software [19] with the 
ability to choose a “push policy” and a weight threshold. This allowed us to 
find a reversible circuit implementing rd53 function with 7 inputs and with the 
gate complexity equal to 11 in the GT library (see Fig. 2). The weight threshold 
was equal to one during the synthesis process. 
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Figure 2: Realization of rd53 function in a reversible circuit with 7 inputs and 
11 gates in the GT library. 


Table 5: Benchmark functions synthesis (new circuits with less input count). 


Function 

New circuits 

Existing circuits 

lines™ in 

GC 

QC 

T-count 

lines™ in 

GC 

QC 

T-count 

gf2~3mult 

7 

73 

740 

632 

9 

11 

47 

63 

gf2~3mult 

7 

79 

712 

632 

gf2~3mult 

7 

145 

704 

654 

gf2~4mult 

9 

415 

47649 

10838* 

12 

19 

83 

112 

gf2~4mult 

9 

1834 

5914 

5156 

nth_prime9_inc 

9 

3942 

19313 

15234 

10 

7522 

17975 

14193 

rd73 

9 

296 

43421 

8765* 

10 

20 

64 

98 

rd73 

9 

835 

4069 

3521 

rd84 

11 

679 

359384 

25364* 

15 

28 

98 

147 

rd84 

11 

2560 

12397 

8772* 


an ancillary line is required 


6 Experimental results 

We developed an open source software ReversibleLogicGenerator [19], which 
implements the basic cycle-based synthesis algorithm from [18] and all the gate 
complexity reduction techniques, described in this paper. Using our software, we 
conducted a series of experiments on reversible benchmark functions synthesis. 
The results are presented in Tables 5-7. The synthesis time in the worst case 
was a matter of seconds. 

We were able to obtain more than 40 new reversible circuits, which have 
less input count, less gate complexity or less quantum cost compared to existing 
circuits. All specifications for benchmark functions and their names were taken 
from the Reversible Logic Synthesis Benchmarks Page [7] and from the RevLib 
site [16]. We use the following conventions in the tables: lines is the number of 
inputs in a reversible circuit ©, GC is the gate complexity L(S) of this circuit 
and QC is its quantum cost W(&)\ T-count is the number of T gates in a 
decomposition of the circuit into Clifford-1-T gates. 

The quantum cost of obtained circuits was calculated with the help of the 
software RCViewer+ [2], Its calculation is based on the paper [6], according to 
which a generalized Toffoli gate with negative control lines may have the same 


11 















































Table 6: Benchmark functions synthesis (new circuits with less gate complexity). 


Function 

New circuits 

Existing circuits 

lines 

GC min 

QC 

T-count 

lines 

GC min 

QC 

T-count 

2of5 

6 

9 

268 

191* 

6 

15 

107 

119 

2of5 

6 

10 

118 

135 

2of5 

7 

11 

32 

42 

7 

12 

32 

49 

3.17 

3 

4 

14 

14 

3 

6 

12 

14 

3.17 

3 

5 

13 

14 

4bl5g_2 

4 

12 

57 

55* 

4 

15 

31 

35 

4bl5g 4 

4 

12 

49 

45* 

4 

15 

35 

31* 

4bl5g 4 

4 

14 

47 

45* 

4bl5g 5 

4 

14 

72 

54* 

4 

15 

29 

42 

4mod5 

5 

4 

13 

14 

5 

5 

7 

7 

5mod5 

6 

7 

429 

294* 

6 

8 

84 

* 

o 

r- 

6 sym 

7 

14 

1308 

628* 

7 

36 

777 

741 

6 sym 

7 

15 

825 

624* 

9sym 

10 

73 

61928 

7004* 

10 

129 

6941 

5484* 

9sym 

10 

74 

31819 

6788* 

ham7 

7 

19 

77 

85 

7 

25 

49 

42 

hwbl2 

12 

42095 

134316 

98482 

12 

55998 

198928 

134131 

nth prime7 inc 

7 

427 

10970 

5403* 

7 

1427 

3172 

2837 

nt h.pr ime 7 inc 

7 

474 

10879 

5403* 

nth prime7 Jnc 

7 

824 

2269 

1906 

nth prime8 inc 

8 

977 

10218 

7359* 

8 

3346 

7618 

5985 

nth prime8 inc 

8 

1683 

6330 

5213 

nth prime9 inc 

10 

2234 

22181 

17292 

10 

7522 

17975 

14193 

nth primelO inc 

11 

5207 

50152 

38261 

11 

16626 

40299 

30315 

nth.primel 1 _inc 

12 

11765 

124408 

92937 

12 

35335 

95431 

68255 

rd53 

7 

11 

96 

100 

7 

12 

120 

124 


an ancillary line is required 
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Table 7: Benchmark functions synthesis (new circuits with less quantum cost). 


Function 

New circuits 

Existing circuits 

lines 

GC 

QC min 

T-count 

lines 

GC 

QC min 

T-count 

2of5 

7 

12 

31 

42 

7 

12 

32 

49 

6sym 

7 

41 

206 

184 

7 

36 

777 

741 

9sym 

10 

347 

1975 

1680 

10 

210 

4368 

4368 

hwb7 

7 

603 

1728 

1400 

7 

331 

2611 

2245* 

hwb8 

8 

1594 

4852 

3748 

8 

2710 

6940 

5201 

hwb9 

9 

3999 

12278 

10220 

9 

6563 

16173 

12150 

hwblO 

10 

8247 

26084 

20368 

10 

12288 

35618 

25939 

hwbll 

11 

21432 

69138 

52922 

11 

32261 

90745 

63430 

hwbl2 

12 

42095 

134316 

98482 

12 

55998 

198928 

134131 

nth prime7 inc 

7 

824 

2269 

1906 

7 

1427 

3172 

2837 

nth_prime8_inc 

8 

1683 

6330 

5213 

8 

3346 

7618 

5985 

rd53 

7 

12 

82 

92 

7 

12 

120 

124 

rd53 

7 

12 

95 

100 

rd53 

7 

11 

96 

100 


an ancillary line is required 


quantum cost as the corresponding generalized Toffoli gate without negative 
control lines. Also we included the T-count cost measure for all circuits in the 
tables (cost calculation was based on the paper [8]). Despite the fact that this 
cost measure is very popular for fault-tolerant circuits in the literature, it is not 
universal in the case of limited ancillary lines availability. According to [8] , there 
is a circuit of Toffoli gates that cannot be implemented, using Clifford-1-T gates, 
without an ancillary line. We marked such T-count metrics by the asterisk 
symbol in the tables. 

Tables 5, 6 and 7 contain results for obtained reversible circuits with less 
input count (column lines m i n ), less gate complexity (column GC m i n ) and less 
quantum cost (column QC m i n ) compared to existing circuits respectively. Since 
we compare our circuits only with circuits consisting of gates from NCT and GT 
libraries and since the NCT library is a part of the GT library, such comparison 
made by us is correct. 

We have not included circuits with more than 12 inputs in the tables just 
because of the limited format of the paper. One can easily synthesize such 
circuits with the help of our software. 

With the help of developed software we were able to find a reversible circuit 
with 7 input lines and with 11 gates from the GT library for one of the most 
popular benchmark functions rd53 (see Fig. 2). This circuit and all other 
circuits described in the tables above can be freely downloaded in TFC and 
REAL formats from the cite [19] as well as ReversibleLogicGenerator software 
itself. 
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7 Conclusion 


A reversible circuit with n inputs necessarily defines a permutation from the 
symmetric group S(B"). Permutations that correspond to all the gates NOT, 
CNOT and C 2 NOT generate the alternating group A(B") if n > 3, and per¬ 
mutations that correspond to all the gates C k NOT, 1 < k < n generate the 
symmetric group. This implies that we can use the permutation group theory 
to successfully synthesize a reversible circuit for a given reversible specification. 

In the paper, we briefly described the first asymptotically optimal in NCT li¬ 
brary synthesis algorithm, based on the permutation group theory, which makes 
it possible to obtain a reversible circuit without additional inputs. We also sug¬ 
gested the “moving and replacing” algorithm for gate complexity reduction for 
circuits consisting of the gates from the GT library; the algorithm is based on 
equivalent replacements of gate compositions and on conditions of independence 
for the gates with negative control lines. 

We described some gate complexity reduction techniques that use the permu¬ 
tation group theory. Among them are the search of a Boolean hypercube and 
an effective cycle disjoint. We presented experimental results for benchmark 
functions synthesis, which include more than 40 reversible circuits consisting 
of gates from the GT library, obtained with the help of developed open source 
software that implements all described techniques. 

We believe that the permutation group theory may allow us to obtain better 
reversible circuits for all benchmark functions, and we hope that this paper will 
motivate other researchers to improve our results. 
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